First visualization

The following visualization is trying to visualize the relationship between housing price and the size of the living space. Since the original data is too big, the data are sorted based on zipcode.

Second visualization

The following 2 graphs are indicating the number of sold houses in each month. Since the dataset only documents the data from beginning of May 2014 to ending of May 2015, we figure that make them seperate plots would be better for visualizations and comparisions. The reason why we use bar graph is that : 1. It indicates the number and the rank. 2. It’s discrete data.

Third visualization

The attached visualization is illustrating the correlation between the built year for the house and the average selling price within that year. The discrete data is arranged in chronological order. Since the data fluctuates a lot, a trendline would be considered as necessary to specify and indicate the overal trends.